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ABSTRACT 

This document reviews "The Collected Works of George 
E. P. Box," edited by G. C. Tiao. The two-volume collection is 
divided into five parts. Each. part is prefaced with an introduction 
by a statistical researcher giving his view of the motivation and 
highlights of the papers presented. The 16 papers in Part 1 on 
statistical inference, robustness and modeling strategy are preceded 
by a summary by S. M. Stigler. Part 2 features 14 papers on 
experimental design and response surface methodology, introduced by 
B. H. Margolin, explaining Box's explorations of response surfaces, 
in Part 3, Box's efforts in time series analysis and forecasting are 
summarized by C. W. J. Granger. The set of papers in Part 4 is a 
collection of Box's works in the derivation of distributions, 
transformations, functional relationships and non-linear models. I. 
Guttman presents a comprehensive summary of the included 13 papers. 
Part 5 features 10 papers on the application of statistics. The 
introduction by R. D. Snee discusses Box's expository clarity. This 
study of Box's contributions indicated simultaneous progress on many 
fronts, with the bulk exhibiting a combination of theoretical and 
methodological aspects geared towards solving problems in technology 
and applied science. (PN) 3 
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OPERA SELECTA BOXI 
by 

Seymour Geisser 1 
University of Minnesota 

0. The collected works of G.E.P. Box edited by G.C. Tiao (1984) features 69 
articles out of a totality of 120 articles and 6 books attributed to him and co- 
authors during the period 19*7-1984. All of his books and more than 2/3 of his 
papers were collaborative efforts with a wide variety of statisticians, probably 
a larger number than any other statistician during the last 40 years ** so much 
for the statistics. 

Without a doubt the responsibility for the prominence of the Wisconsin 
Statistics Department, many of whose members or now former members have been his 
major collaborators, is mainly due to the efforts of Box. Most frequent as co- 
workers have been G.C. Tiao, M.R. Draper, G. Jenkins and both Hunters (J.S. & 
W.G.), G. LJung, B. Abraham, and J.F. MacGregor. This is not to gainsay the 
Important papers he wrote with D.R. Cox, P.W. Tidwell, s«L. Andersen, I. 
Guttman, K.B. Wilson, H.L. Lucas, D.A» Pierce, and a number of others. 

Clearly, Box exhibits an enormous capacity for simultaneously inspiring and 
working closely with a number of different researchers on a variety of 
statistical Issues no mean feat, given the history of statistical egos, 
polemics, and assorted petty quarrels. For example, from the late 1950 f s to the 
mid 1960*s, he must have been working more or less during the same period with 
Jenkins on control problems and time series, with Tiao on Bayesian inference, 
with J.S. Hunter on factorial designs, with Draper on response surfaces, with 
D.rf. Behnken on rotatable designs, with W.G. Hunter on modeling, with G.S. 
Watson on robustness, and with Tidwell and Cox individually on transformations. 
Indeed, before anything else is said, one crucial role Box has played is as "The 
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Great Collaborator" — of course not of the Quisling variety. Later we shall 
hear of him as "The Great Communicator" (In the sense of mastery of exposition 
rather than actor transmuted Into President), 

The "Collected Works" appear In 2 volumes divided Into 5 parts. Volume I, 
Part 1 contains 16 papers on Statistical Inference, Robustness, and Modeling 
Strategy, while Part 2 features 1* papers on Experimental Design and Response 
Surface Methodology, in Volume II, the remaining parts are: 3, Time Series 
Analysis and Forecasting; Distribution Theory, Transformation of Variables 
and Non-Linear Estimation; 5, Application of Statistics. Each part is prefaced 
with an introduction by a distinguished figure in the field giving his view of 
the motivation and highlights of the more important papers presented. 
Appropriately distributing a large number of an individual^ works into a few 
mutually exclusive categories presents difficulties, but here a sensible 
allocation was made. Within each part the papers are arranged chronologically 
by publication date. Several minor exceptions may be noted. In Part 1, the 
last paper is not in its proper chronological order (also true of the tenth 
paper in Part 3) and the fourth paper appears to have been better placed in Part 
2. A further quibble is in regard to the seventh paper in Part H as with a few 
other papers — they could just as easily have been placed in other parts. 

It would be presumptious to believe that these works exhibit the totality of 
Box's contributions because obviously his career is far from over. In this 
sense the "Collected Works" is premature since by no means have his research 
efforts abated. In view of this fact the editors missed the rare opportunity of 
allowing the scientist to comment on his own work, discussing what he believed 
most important and perhaps how he arrived at some of his Ideas and the 
connection between works that may seem to us quite disparate. This would also 
have permitted him to correct mistakes, misprints, etc. of one kind or another 



in the text. The articles are photocopies of the originals and hence large 
variations in font, texture and typography are evident, 

1. The papers in Part 1 are preceded by an erudite summary as only S. M. 
Stigler can render. He traces the growth of Box f s original conception of 
robustness to his more mature views later on (7*, 13 f 22). This certainly needs 
to be contrasted with the current industry it spawned, I recall, when first 
hearing about someone who claimed his method or analysis was robust, picturing a 
portly gentleran of the Colonel Blimp variety oblivious to all that one might 
learn from the data keenly intent only on the fact that his nominal significance 
level was approximately correct. Later on, when the frequentist industry had 
sufficiently proliferated, I imagined a procedure to be robust if it could find 
the center, whatever that might mean, of any of a set of differing, perhaps 
misshapen, fat-tailed cheeses, replete with varied sized holes. But it is 
informative to understand Box's original view and how it progressed. He simply 
stated that a statistical criterion that retained its sensitivity to changes in 
the factor of presumed interest but was more or less impervious to extraneous 
perturbations was robust and on that account useful. This came to be known as 
criterion robustness. There is a presumption in this view that irrespective of 
the perturbations the criterion retains its factor of interest *- a fact that 
may not be the case. 

Later on he introduced a Bayesian view of robustness (24, 28, 43) wherein 
one assesses the sensitivity of an inference about a parameter 6 conditional on 
a "grudging and judicious elaboration" of the current mocel to a set of models 
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wherein 6 retains its physical interpretation. In Box's example the standard 
model is neatly encapsuled in a larger one by the introduction of a discrepancy 
parameter 6 , which for a particular value B - 8 Q , yields the original model. 
The posterior distribution of e conditional on 8 and data D - (yjt....y n ) may be 
examined to determine the effect of varying the discrepancy paiameter. When the 
effect is minimal, the older model could be retained and the inference 
"Judiciously » subsumed in P(e|D, B Q ) i.e., based on the usual model since this 
distribution will represent a more parsimonious description and generally a 
tighter set of high probability values for e. However when this is not the 
case, robust estimation is n grudgingly w provided by the marginal posterior 
distribution P(e|D) - E g P(e|D,B) . Box had now removed the board from under 
those agile surfers who frequent every new wave. 

In commenting on Box's Bayesian Robustification, Barnard (1980) noted that 
when the effect is minimal, one had a robust sample. In terms of likelihoods, 
if 

U8 2 lD,8) " 

varied little with alternative values of 8 for almost all admissible pairs 
(O-jtOg), then the sample represented by D was robust. 

Similarly in a Bayesian context, if for all interesting 9 

sup|P(e|B 0 ,D) - P(6|B.D)| < 6 
B 

for example, where 6 is n small n enough to suit one's purpose, then D should 
qualify as a robust sample with respect to the estimation of 8. Further and 
most importantly, if Y is either a future observable or some function of a set 



of them from the process that generated the data D, and again if for all 
interesting 8 

3up|P(y|B ot D) - P(y|B,D)| < 6, 
6 

then D is a robust sample for the prediction of ::. Note that we could qualify 

this by using only particular values of y because there may be situations where 

our interest is very specific. For example our interest might be focused on the 

computation of the chance that Y will exceed a specific threshold value y t say. 

t 

It could turn out that for a specific set of values of y e y under interest 

P(y|B 0 ,D) i P(y|B t D) for all 6, 

then for this purpose D is specifically robust. However' for other values of y 
this may be far from the case and D would not be completely robust. Clearly at 
y - + • t D is always specifically but meaninglessly robust. 

I stress predictive robustness here because of its potential for being 
different from parametric robustness in certain cases; i.e., lack of parametric 
robustness need not imply lack of predictive robustness. Further if e - (tut), 
the sample could be robust with respect to n but not with respect to t, and it 
may or may not be robust with respect to y. Not being robust with respect to y 
is however a clea** indication of the failure of robustness with respect to e. 
The property of predictive robustness is t I believe, the most important and 
useful of all robustness criteria. A more encompassing notion of a predictively 
robust sample would involve a prescribed lack of variation when both the prior 
and the likelihood (i.e., the entire model) are jointly perturbed from some 
standard. 
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Tracking Box's path through robustness is in many ways similar to tracing 
through his other research concerns . They are informed by his changing but 
flexible inferential philosophy. They all begin with an adherence to 
frequential theory (3, 7, 11 t 12), then succumb to the influence of Fisherian 
ideas (in this case permutation or randomization tests (13)), and finally 
conform to a rather flexible Bayesian approach, (2**, 32) and Box and Tiao 
(1972), as he becomes more enmeshed in real technical and scientific problems. 
His latest efforts appear to reflect the fact that he now is being persuaded by 
the value of predictive arguments in research work. This is demonstrated by his 
more recent opus (43) where he used the predictive distribution for criticizing 
a model* 

That Box did not earlier recognize the value of predictive distributions 
perhaps stems from the fact that he dealt largely with problems in the physical 
sciences where the error was not as inherent in the sampling unit as in the 
biological and social sciences^ By this I mean that in many (not all) problem* 
in the physical sciences, meaningful physical entities can be established and 
presumeably when enough important factors are included the error that remains is 
to a large degree a function of the measuring device. In the biological and 
social sciences the material under investigation is subject to inherent 
variation irrespective of the accuracy of measuring instruments and hence 
requires a much scronger emphasis on observables and inferences about them. Box 
enjoys a middle ground between these extremes. He asserts that in a 
relationship n - ♦(x 1 ,...,x p ), all physical variables, say, where the x f s are 
controllable and measurable essentially without error, there is still "error" 
involved in the response variable for repetition of the experiment at the same 
set of x's not due to inadequate measuring devices, but to uncontrolled factors. 
Or, to put it another way, it is due to the imperfection of the postulated 
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model. The relationship + Is governed by a set of parameters $ Q , 8j,...,8 k . 
Since we have the capability or potentiality of making as many observations n 
at all values of the x's within our interest , ad infinitum, both the parameter 
representing experimental error and the set B Q , 6 1 ,«««»6 k may have some 
physical meaning as representing constants of the tentative model depending of 
course on its approximative value. The fact that there Is still experimental 
error even In the presence of n perfect* measurement reflects the fact that the 
model Is only approximately adequate. 

Thlj situation is generally different in the softer sciences even where 
measurements can be made essentially without error , but relationships are either 
vague, complex or completely unknown. 0*ten the data consists of the varied 
responses of a sample of Individuals, and our Inference Is to some aspect of the 
population of potentially observable Individuals that our sarnie represents — 
which is invariably finite. We may be Interested in the response of a randomly 
selected new individual from this finite population or some function of the next 
M in a future sample,. e*g. the fraction that lie in a certain Interval. 

Even In the cases that absorbed Box's attention an Important way for 
achieving better models is to compare their predictive capacities. As he sooner 
or later came to realize, convent id hypothesis testing was Inadequate to cope 
with this issue. Sorting amongst rival models was much more sensibly treated In 
the Bayeslan framework; but this would always require a great deal of prescience 
concerning the totality of alternatives to be entertained and one's prior 
probability about the potential truth of each of them. Few serious scientists 
appear to work that way. Box then essentially fused a Bayeslan predictive mode 
with current scientific operating procedure to entertain a provisional model as 
the source for the generation of a data set when no alternative appeared as yet 
on the horizon. This would be checked or criticized by a predictive 
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significance test which calculated the probability of the set consisting of 
those points in the sample space whose probability density was no larger than 
the observed sample, rather than some ordering of the sample points as to their 
discrepancy from the hypothesis. The latter being in line with the more 
standard Fisher-Barnard-Cox view of significance tests. Box's procedure (*3) 
though leading to logical conundrums, c.f. Gelsser (1985), is not rescued by 
ordering sample points as Cox (1980) suggested. It is nevertheless an excellent 
operational procedure and will make no less progress than a logically perfect 
one that is wither too difficult to apply or is in peril of being seriously 
misapplied. 

In other words we have now a further extension of "Robustness" — whether 
Box intended it or not. Just as a robust analysis is one that can resist most 
perturbations that might occur, so it is with Box's predictive model criticism. 

What has been always somewhat curious is the fact that Box had not stressed 
predictive distributions for inference about observables. II appears to me that 
his whole Bayeslan philosophy is oriented in that direction. An example of this 
is his views on prior distributions of parameters which he claims ought to 
depend on the experiment, in other words the Boxlan model parameters in many 
instances are fathered by the experiment and may not have a physical existence 
outside of the experimental setup. Then rightly, the prior distribution of such 
a "parameter" may depend on the likelihood the experiment induces. Why then 
should it be of primary interest to draw a conclusion about such an entity which 
may be totally artif actual or at best whose real status may be murky, rather 
than a potential observable or some function of a future set of them? The 
"parameter" only comes to life, as it were, as a limiting value of an 
interesting function of potential observables. It is curious that Box's extra- 
ordinary sense of experimentation and understanding of models has not liberated 
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him further from being engrossed w!th the estimation of these entities. 

A response that he might make, not unjustifiably, is that he is interested 
in a series of experiments whereby a theory, or more likely a process, which was 
inadequately understood at first could, after several iterations, be much better 
understood* . This would then lead to a sensible working model which would cover 
the main theoretical aspects of the problem* Without in anyway disagreeing with 
such a laudible enterprise (aptly explained in a variety of appealing diagrams 
in several of his papers), I would point out that, at least, when such a model 
was established, then its main purpose was the prediction or even the control or 
regulation of observable*. Also, during the iterative process there was ample 
scope for the use of predictive distributions and predictive sample reuse 
techniques for assessing the adequacy of the succession of provisional models. 
Just as our hierarchical Bayesian colleagues must at some point cease their 
regress in prior hyper parametric assumptions, so too dees this iterative 
modeling scheme need to be put to use. On the other hand there is no quarreling 
with success and unarguably Box is an eminently successful statistical 
scientist. 

2. The second part is introduced by B.H. Margolin who, in a careful and 
thorough manner, explains Box's explorations of response surfaces. 

Boxian experimental designs grew out of and then departed from its Fisherian 
roots because of the necessity to accomodate differing scientific and technical 
needs. Rather than assessing the effects of various factors in a multi- 
factorial comparative trial, the problem Box faced w&s to devise schemes for 
efficiently determining the optimal conditions for the output of some industrial 
process which depended on k controllable quantitative variables. Here instead 
of designing a single experiment, we have a sequence of such trials each 
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depending on its predecessors* Standard factorial designs were employed for 
estimating first derivatives In given subreglons which would Indicate which 
further subreglons to explore* This steepest ascent method Is continued until 
the vicinity of a stationary point is achieved ~~ hopefully one yielding an 
optimal response* For a more detailed exploration of this near stationary 
region entirely new "composite" designs were devised to estimate higher 
derivatives* The two sources of error inherent in such studies are measurement 
error and the bias due to whatever difference there is between an assumed 
response function and the actual one* The accuracy of the estimates of the 
derivatives will be determined by the arrangement of the experimental points* 

Box investigated the optimal estimation of the constants of a planar 
regression surface depending on the k quantitative controllable variables 
subject to homogeneous measurement error where N > k combinations of the levels 
of the variables would be chosen* He showed that the minimum variance property 
for the optimal design is invariant under rotation* This property can be used 
to reduce bias, eliminate certain systematic effects without losing efficiency 
and allow the usual normal theory tests to be exact , Independent of the 
distribution of the observations by a suitable randomization scheme* The basic 
ideas of response surface designs and methodology were exploited, developed and 
extended in a series of papers (5. 6, 9. 10i 14, 16. 17, 18, 19, 20, 21, 26, 30) 
to include non-linear response functions, Bayesian design criteria and the 
accomodation of potential bias in an assumed response surface by the construe* 
tlon of appropriate designs* Margolin properly intimates that Box's work on 
response surface methodology was, by itself, sufficient to keep a statistician 
of the first rank busy for a quarter of a century* The same, of couse, may be 
said and is for at least several other of these research areas that were 
developed during that same period* 
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3« Box's efforts in time series are succintly summarized by C.V.J. Granger, who 
remarks that Box's early vork f brought together and amplified in his "landmark" 
book, Box and Jenkins (1970) f "had a widespread, immediate and dramatic effect 
on the modeling and forecasting of time series." And so it did. This book 
revised and expanded in a second edition (1976) included the bulk of their 
output in this area up until that time. 

An appealing feature of the work was their efforts at building parsimonious 
and applicable stochastic models for a sequence of dependent discrete 
observables in the time rather than the frequency domain. (Box rightly 
emphasizes observable Inference rather than spectral inference, or more bluntly 
the substance rather than the spectre.) This would enable one to investigate 
the process underlying the series or at least provide the simplest flexible 
smoothing function that best represented the series consonant with whatever one 
knew about the process. Optimal forecasting of future observables from such a 
series would then naturally follow. Tt ^RIMA models developed and employed by 
Box, capable of handling non-stationary and seasonal time series (23, 31, 34, 
41), were extended to represent relationships between several such series with a 
view towards simultaneous forecasting and forecasting future values using its 
previous values (42, 45) and those of a related series. Procedures were devised 
for fitting and checking transfer function models and designing optimal control 
schemes (35» 37). 

In pollution problems a new situation arose whereby the effect of some 
external shock (in this case a pollution control law) needed to be taken into 
account in the time series. Thus arose Intervention Analysis (29, 37, 38, 39). 

Throughout his work Box emphasized the importance of the iterative strategy 
of model identification, efficient estimation of the model parameters, and 
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assessing the adequacy of the model's fit by means of diagnostic checks . If the 
fit were demonstrated to be inadequate , one looped through this process 
recursively until one attained a model suitable for forecasting (or control). 
To Box this was the scientific method, just as the search for an optimum of a 
response surface required sequential alteration of design was the scientific 
method* A difficulty, if the procedure is used incautiously or robot-like, is 
the possibility of being unable to extricate oneself out of an interminable 
loop, for example when the ARIMA program is deficient for the task. 

His development of the various theories and methods for time series was only 
exceeded by what was perhaps his principal achievement in this area. This 
turned out to be the superb organization of all these components into a coherent 
program that indelibly marked his work. Of course the fact that, in most 
instances, provision was made for a ready availability and easy implementation 
of his efforts, was no doubt responsible for much of the pervasive popularity it 
enjoyed. 

What might there be to criticize in this program? At least two possibil- 
ities strike one. Instead of attempting to pay attention to the actual 
mechanism in the time series one resorts to essentially black box (ARIMA-BOX) 
techniques which represent a lower order of scientific inquiry. In many cases 
this is not a serious objection because the underlying mechanism is often either 
so ephemeral or so complex (certain economic time series, for instance) as to 
defy discovery in time for any appropriate inference, decision or action. 
Further, the crucial issue for Box was forecasting (or control); and if that can 
be accomplished efficiently or near optimally by these techniques, that should 
suffice for most practical purposes. 

A second criticism is one that entirely permeates this review and is most 
pronounced in this part primarily because it underlies both ultimate goals of 
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time series analysis — forecasting and control. Where are the predictive 
distributions of future values? At best we are given "predictive" means and 
variances or distributions on the assumption that all the parameters are known - 
~ this is not sufficient. Is Box not a Bayesian that his inferential posture is 
incapable of such results? Not so! Although a good deal of Box's time series 
work was in the classical frequentist estimative mode he also provided posterior 
distributions of real and imagined parameters in no small measure. 

As previously indicated 9 he did come around in more recent times to 
considering predictive distributions but mainly for model criticism and not for 
what they are best designed for *• actual prediction 9 inference 9 comparison 9 
decision 9 etc. I have no alternative but to infer that he kindly left something 
undone so that others might enjoy the effort in completing the task-- or he is at 
it now. 

4. The set of papers in Part 4 is a potpourri of Box's work in the derivation 
of distributions 9 transformations 9 functional relationships and non-linear 
models. In a lengthy and effusive introduction 9 I. Guttman presents a 
comprehensive summary of the 13 papers in this part. 

Several of Box's most important papers appear here. For example, the first 
one (3) derives the distribution of a class of modified likelihood ratio test 
statistics occurring in multivariate analysis whose moments are of general 

specified form. He shows that the distribution function can be written in terms 

2 

of an asymptotic series involving x distributions of successively higher 
degrees of freedom. The coefficients are such that one need only use a suitable 
number of terms to calculate the tail probability of the statistic with 
sufficient accuracy under the null hypothesis. 

The next two papers 9 representing half of the selected papers that Box 
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published in the Annals of (Matheaatieal) Statistics and a third of the six 
papers out of his total of 120, (one wonders if this is not a statement about 
one of our most "prestigious" Journals or G.E.P. Box which I leave the reader to 
decide) deal with theorems on the distribution of quadratic fo-ms and their 
application to the null distribution of the F-test in the analysis of variance. 
His particular interest here is to determine the effect that departures from 
certain standard assumptions have on the test. He shows that moderate variance 
heterogeneity has only a modest effect for equal group sizes in one way analysis 
of variance situations but that unequal group sizes have a larger effect. In a 
two way analysis of variance he showed that serial correlation between the 
columns can induce large discrepancies in the nominal significance level for 
testing the equality of column means. 

An extension of Box's work to repeated measurement designs involving several 
groups was made by Geisser and Greenhouse (1958). They suggested an F-test 
whose degrees of freedom were reduced by the appropriate factor e calculated 
from Box's work, which then was estimated from the sample covariance matrix 
among the repeated measurements assumed to be multivariate normal. This e F- 
test continues to serve as a popular alternative to a full scale multivariate 
test collier et al (1967), Wilson (1975), and under most reasonable alternatives 
it has greater power, it has also been shown to be useful in growth curve 
situations even with incomplete data, Schwertman (1978) and Schwertman et al 
(198<0. The estimate e has also recently been shown to provide a locally best 
invariant test of whether the standard analysis of variance F-test in repeated 
measurement situations is appropriate - Grieve (1984) . 

Box returned to this work a quarter of a century later to analyze analysis 
of variance situations with autocorrelated observations using a Bayesian 
approach (44). 
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A series of papers on transformations involved a response modeled as 
- f(x,8) ♦ e i for x a set of known variables and 9 a set of unknown 
parameters that can be fitted by least squares when the e^ were independently 
and normally distributed with constant variance. His first efforts were 
transformations on the set x, in terms of powers, logs etc. to reduce the 
function f(x,e) to a? simple a form as possible (25), usually a linear 
function when the true relationship was unknown. 

Soon thereafter came the famous Box-Cox paper (27) in which non-linear 
transformations were examined for the elements of the vector 




for A - 0 



such that for some unknown A, 

E(Y <X) ) - X8 ♦ e , 



where X is a known matrix and 6 a vector of unknown parameters and the vector 
e - MVN(0,o 2 I). 

Maximum likelihood and Bayesian methods were presented for the estimation of 
A. Once A was established, a standard analysis conditional on that value might 
proceed. A second Box-Cox offering (46) in this vein was a rebuttal to a paper 
by Bickel and Doksum (1981). The latter showed that the joint estimates A and 6 
can be highly correlated even when the error variance was small, so that the 
marginal variances of 8 can be considerably larger than the conditional 
variance given A. Box and Cox argued that when e depends on A and A is poorly 
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determined, the units in 8 are not comparable. It seems to me that the whole 
brouhaha revolved around the wrong issue* The proper focus should have been the 
effect on the prediction of Y after transforming back from Y* X) so that a 
standard metric is established for comparison. In such situations the most 
important issue is the prediction of future values of Y, since hardly anyone 
would believe in the mechanistic validity of models that are simultaneously 
linearized, normalized and homoscedastisized with rather bizarre powers of Y*- 
unless there were unimpeachable scientific reasons for their acceptability. 
This kind of modeling is basically a convenient way of reasonably approximating 
relationships that possess adequate predictive power. 

5. The last part features 10 papers on the application of statistics. The 
informative introduction by R.D. Snee extolls Box's expository clarity. We are 
also informed that with regard to statistics Box was an autodidact, and this, in 
conjunction with his masterly facility for presenting graphical paradigms, was 
mainly responsible for his formidable skill as the great statistical 
communicator. So much for his formal training (B.A., Ph.D.) at University 
College, London. 

As a result of a war-time project several papers (1, 2), displaying his 
practical statistical acumen, deal with the effects of phosgene and mustard gas 
on laboratory animals. Another paper (8) involves an investigation of an 
automatic machine for testing pigment strength in the chemical industry, and a 
duo of papers (38, HO) detail statistical studies of Los Angeles smog data 
wherein he popularized Intervention Analysis, used to analyze the effect of the 
occurance of an event on a time series. This was a topic (without the 
felicitous term) he had studied theoretically some 10 years previously (29). 

In still another paper (36), he leans heavily on his involvement with 
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environmental problems such as smog to offer his views on scientific 
experimentation by diagramming the flow and interaction of hypotheses, models, 
experimental data, deduction, induction, etc. I was struck here by his near 
analogy of the almost perfect correlation between storks 9 nests and human births 
with the "correlation" between smoking and lung cancer. This was an uncalled 
for comparison and I trust that after some reflection he no longer seriously 
believes, if he ever did, that both "relationships" enjoy equivalent evidential 
standing . 

A paper (15) on "Evolutionary Operations," or EVOP another Boxism patterned 
on natural selection, presented a very sensible way to run a manufacturing 
process, or to quote Box, "A process should be run as to generate product plus 
information on how to improve the product." Perhaps even more generally he 
might now add "and to implement the improvement based on a cost-benefit 
analysis". At present, natural selection is being altered to take account of 
"Punctuated Equilibrium," so that one can only wonder if "Punctuated or perhaps 
Punctuated Interventional Operations" is the new order of the day. (PIOP, one 
hopes is not restricted to the sky.) Designed fo;« manufacturing processes, EVOP 
has not much influenced statistics or statisticians, which probably is a source 
of chagrin to its creator. It's actual impact on industry is unknown to me. 

His paper (with less fanciful appelations) on Growth and Wear Curves (*»), on 
the other hand, is probably one of the more heavily cited in the statistical 
literature. Here he initiated the practical groundwork for analysis of variance 
problems involving repeated measurements and growth curves and followed this up 
later with theoretical work on the distribution of quadratic forms which was 
previously discussed. 

6. Unlike mathematicians, many distinguished statisticians such as Box often 
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take some time to attain their full creative powers. Early in their careers, 
there is a fair display of technical virtuosity. Later, they develop deeper 
insights into important statistical issues. Some are led into ph.* '-^ophical 
paradigms for the foundations which at least overtly they believe to be 
unflawed. Being without blemish may require being devoid of relevance, or if 
relevant, impossible to execute. Indeed, the bust of statistical ideas often 
engender disturbing paradoxes or counterexamples that render them suspect as to 
their capability of being generalized into a fault-free inductive system. The 
history of the logical foundations of statistics is replete with such failed 
panaceas. 

For Box, a preoccupation witn foundational issues was never an overriding 
concern, although he couid very vigorously and trenchantly defend his eclectic 
view which combined such disparate notions as: the Bayesian approach; prior 
distributions depending not only on the likelihoopd, Box and Tiao (1973); and 
not only on the sample size, Box and Tiao (1968); but even on the observations 
themselves (27); randomization and permutation tests (13, 30); predictive tests 
of significance (i»3); and the concept of power (3, 7, 22). Attacked from all 
sides of the ideological fence he, in an apian manner, disarmed friendlier 
critics with mellifluous argument and subdued hostile ones with stinging wit. 

His work indicates that he was quick to perceive what the important 
practical problems were and readily devised informative statistical paradigms 
accompanied by sensible (not necessarily final) solutions for them, uninhibited 
by a strict adherence to any one inferential ideology. Hence the principal 
characteristic of the Boxian approach is best summed up as pragmatism par- 
excellence. 

Our perusal of Box's contributions indicated simultaneous progress on many 
different fronts with the bulk exhibiting a rare combination of theoretical and 
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methodological aspects geared towards solving problems in technology and applied 
science. One recent feature discerned was an increased emphasis on predictive 
distributions and the analysis of observables. if we add to this his evident 
interest in EVOP it would not be too surprising to see Box getting involved in 
areas such as quality assurance , manufacturing engineering, automated processes, 
experimental therapeutics, and adverse drug reactions using and expanding such 
notions as control, regulation, feedback and near optimization. An excellent 
though limited initiative into a few of these aspects using observable or 
predictive analysis appears in the work of Aitchison and Dunsmore (1975). 

Once the difficult tasks of understanding the reasons and purposes for the 
collection of a data set and formulating an appropriate model are completed, the 
principal job of the statistician is calculating probabilities of observables 
that are unknown, conditional on known observables. The wave of the future in 
statistics is in calculating relevant probabilities for the future. 
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